Goto

Collaborating Authors

 arabic speech mispronunciation detection dataset


ASMDD: Arabic Speech Mispronunciation Detection Dataset

arXiv.org Artificial Intelligence

The largest dataset of Arabic speech mispronunciation detections in Egyptian dialogues is introduced. The dataset is composed of annotated audio files representing the top 100 words that are most frequently used in the Arabic language, pronounced by 100 Egyptian children (aged between 2 and 8 years old). The dataset is collected and annotated on segmental pronunciation error detections by expert listeners.